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ABSTRACT 



Context. Astronomic line mapping with single-pixel heterodyne instruments is usually performed in an on-the-fly (OTF) or a raster- 
mapping mode depending on the capabilities of the telescope and the instrument. The observing efficiency can be increased by 
combining several source-point integrations with a common reference measurement. This is implemented at many telescopes, but a 
thorough investigation of the optimum calibration of the modes and the best way of performing these observations is still lacking. 
Aims. We derive optimum mapping strategies and the corresponding calibration schemes based on the known instrumental perfor- 
mance in terms of system stability and slew times. 

Methods. We use knowledge of the instrumental stability obtained by an Allan variance measurement to derive a mathematical for- 
malism for optimizing the setup of mapping observations. Special attention has to be paid to minimizing of the impact of correlated 
noise introduced by the common OFF integrations and to the correction of instrumental drifts. Both aspects can be covered using a 
calibration scheme that interpolates between two OFF measurements and an appropriate OFF integration time. 
Results. The total uncertainty of the calibrated data consisting of radiometric noise and drift noise can be minimized by adjusting the 
source integration time and the number of data points observed between two OFF measurements. It turns out that OTF observations 
are very robust. They provide a low relative noise, even if their setup deviates considerably from the optimum. Fast data readouts 
are often essential to minimize the drift contributions. In particular, continuum measurements may be easily spoiled by instrumental 
drifts. The main drawback of the described mapping modes is the limited use of the measured data at different spatial or spectroscopic 
resolutions obtained by additional rebinning. 

Key words. Methods: data analysis - Methods: statistical 



1. Introduction 

Mapping of astronomical objects with single pixel receivers re- 
quires a dynamical scanning of the object with the telescope, so 
that different coordinates are observed at different times. The ob- 
serving scheme is complicated by the fact that all astronomical 
receivers are affected by gain instabilities (see e.g. lKrauslll980t 
Rohlfs & Wilson, 1986), so that the sensitivity is a function of 
time as well. This dilemma can be solved by regular observa- 
tions of a reference position on short time scales compared to 
the drift time scale. Thus all mapping schemes include a short 
loop for source-reference measurements and a longer timescale 
for scanning the whole ma fflOne can distinguish between sym- 
metric observing modes, where one reference measurement is 
done for each source point, using equal integration times in both 
phases, and asymmetric modes where a reference measurement 
is done only after observing a number of source map points. 
Examples of symmetric modes are dual-beam switch raster maps 
or frequency-switch on-the-fly maps. Each point can be treated 
individually and the optimum timing can be c omputed following 
the formalism developed in Ossenkopf (2008, paper I). 



1 Similar approaches are required to map objects with array receivers 
that cover only a part of the object, but there information from dif- 
ferent pixels can be combined to quantify drifts leading to mo r e flex- 
ible and efficient obs erving schemes (see e.g. lEmerson et al 1 119791 : 
iReichertz e"t~alll2001h . 



Asymmetric modes are position-switch on-the-fly (OTF) 
map^| and asymmetric raster maps. In an OTF map, the tele- 
scope continuously scans the area to be mapped, while the de- 
tector integrates and data are read out at a high rate. Every in- 
dividual data dump represents one point on the map, and their 
distance is determined by the readout rate and the scan veloc- 
ity. After a finite number of points, defining one scan, the tele- 
scope slews to a position free of emission (OFF) for the refer- 
ence measurement and then returns to the map for the next scan. 
The theoretica l foundations for effic i ent OTF mapp i ng sch emes 
were laid by IMangum et all d200Q|) ; iBeuther et ail (|2000), and 
Schi eder & Kramer! (|2001|) . Asymmetric raster maps are simi- 
lar to OTF maps except that dead times occur when the tele- 
scope moves between different points on a map. Because of the 
similarity we restrict ourselves here to analysis of OTF maps 
and discuss the differ ences for raster maps only in Appendix C. 
IMangum et al.l d2000t) has shown that OTF mapping is a very ef- 
ficient mode for the observation of large fields in the sky with 
single-pixel receivers. The high efficiency results from the con- 
tinuous scanning and integration avoiding dead times between 
the observation of adjacent points and from the reuse of the ob- 
servation of a single reference position for the calibration of sev- 
eral data points. The OTF mapping imposes, however, harder re- 
quirements to the pointing and timing behavior of the telescope, 
which may not always be fulfilled. 



2 Through the rest of the paper we will use the term OTF map syn- 
onymous for position-switch on-the-fly maps. 
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ISchieder & Kramer! (12001 1) show that the knowledge of the 
system stability can be used to derive the optimum approach for 
performing actual observations. They computed the timing pa- 
rameters that provide the minimum uncertainty of the calibrated 
data, composed of radiometric noise and drift noise, per unit ob- 
serving time. Unfortunately, their computations were restricted 
to fluctuations with an 1 jf a power spectrum with a spectral in- 
dex a of 2 and 3. When designing the mapping observing modes 
for HIFI, the heterodyne instrument of the Herschel Space 
Observatory, to be launched in 2008 (Ide Graauw & Hel mich, 
2000), we measured , howe ver, a wide variety of spectral drift 
indices (lOssenkopfl [2008). We noticed that it is important 
to distinguish between spectroscopic drifts, which character- 
ize the variation of spectra after a zero-order baseline subtrac- 
tion, i.e. after the correction for fluctuations that affect all chan- 
nels in the same way, and total-power drifts, which character- 
ize the variation of the raw spectra, thus being typically dom- 
inated by fluctuations of the overall gain of the instrument. 
Spectroscopic drifts often show values between 2 and 3, but 
we also noticed that many instrumental fluctuations were dom- 
inated b y 1/ f noise, i.e. following a spectral index a close 
to unity (Ossenkopfj, 120081) . In particular total-power drifts of- 
ten show very shallow spe ctra. Consequently, the results from 
ISchieder & Kramer! d200ll) cannot be directly applied. This re- 
quires a re vision with respect to gener alizing the spectral index. 
Moreover. ISchieder & Kramer! ([2001) assumed a special cali- 
bration scheme for all mapping modes where single lines in a 
map are combined with a single reference measurement for cali- 
bration, although other calibration schemes are possible as well. 
We had to evaluate the profit from exploiting the possibility to 
scan a series of lines in alternating directions before going to 
the reference position. Thus we have repeated their computa- 
tions in a more general framework allowing for various calibra- 
tion schemes and arbitrary spectral indices resulting in general 
guidelines for an optimum performance and calibration of map- 
ping observations. 

Apart from the optimization of individual OTF scans dis- 
cussed here, there exists a number of methods to reduce striping 
effects in OTF maps, in particular baseline offsets, either by an 
appropriate a posteriori data manipulation, assuming purely lin- 
ear drifts or special correlations in the observed structures, or 
by a combination of mul tiple maps observed in different scan- 
ning directions (see e.g . [Steer et all 1 1984t [Emerson & Grave, 
1984; iMaino et aUll999HAshdown et al.Ll2007l) . In fact both ap- 
proaches should be combined. Here we will focus on the op- 
timum observational setup which minimizes drift effects from 
the very beginning, so that the measured data for all individual 
data points show a high quality, independent of the number of 
coverages in which an object is observed, so that it is also ap- 
plicable to mappings of bright lines which are well detected in 
a single OTF map coverage. Altogether, the strategies discussed 
here should be 

The outline of the paper follows our basic approach to the 
optimization problem. In Sect. 2 we introduce the properties of 
the OTF mapping mode and discuss the possible ways how mea- 
sured data will be calibrated to obtain scientific data. In Sect. 
3 we evaluate the different calibration schemes with respect to 
their sensitivity to drift effects. In Sect. 4 we demonstrate the 
application of the different calibration schemes to actual obser- 
vations performed at the KOSMA 3 m telescope. From the best 
calibration scheme we optimize the exact timing of the observa- 
tions with respect to total noise in Sect. 5. Sect. 6 discussed some 
limitations of OTF modes and the conclusions for the observing 
mode efficiencies are summarized in Sect. 7. 



2. Introduction to OTF observations 

2.1. The general measurement scheme 




Direction of telescope motion OFF position 



Fig. 1. Sketch of an OTF observation. The dots symbolize the moments 
when the backends are read out. The integration starts when the tele- 
scope enters the rectangular area of the map. In this example, the OFF 
position is visited after every two lines and the scanning direction is 
alternating. 

A general introd ucti on to OTF mapping was given by 
iMangum etaf] (l2000l) and lBeuther et all (l2000h . The general se- 
quence of operations is demonstrated in Fig.Q] A map observa- 
tion is split into individual scans and their corresponding OFF 
measurements. Every scan consists of N source integrations ob- 
tained while continuously scanning the map. Each integration 
covers the time f s spent between the edge of the map and the 
first readout or between two subsequent readouts, symbolized by 
the dots in the picture. In the example, multiple lines are com- 
bined within one scan and a turn is performed between subse- 
quent lines so that they are scanned in opposite directions. At the 
end of a scan, the OFF position is visited and the reference mea- 
surement is performed with an integration time ?off- In principle, 
it is possible to go from an arbitrary position within the map to 
the reference position. We will show later that it is preferable to 
complete an integer number of lines in a scan. 

The measurable signal can be described by a continuous 
function s(t). For each source point i within a scan the integra- 
tion provides a source count rate 

l r*' 

c Sji = - dt s(t) (1) 

h J(i-l)t s 

if we assign t = to the start of the scan. Analogously, we can 
define the count rate for the OFF position as the average of the 
signal during the OFF integration. Because of the telescope mo- 
tion during the source integration time f s , the source count rate 
c Si ; correspond s to a broaden e d eff ective beam along the scan- 
ning direction. iBeuther et alJ (l2000h have shown that for OTF 
maps where the readout is performed on a spatial grid corre- 
sponding to a Nyquist sampling of the map, the effective beam 
broadening is less than 4 %. They assumed a 14 dB edge taper 
corresponding to a Nyquist sampling of FPBW/2.4. 

However, many observations are not performed exactly on 
Nyquist sampling, either by ignoring the difference between half 
beam width sampling and full Nyquist sampling or by using a 
common sampling for different tracers observed at different fre- 
quencies and consequently different beam widths. In Appx. A 
we show that the OTF mode does not provide a noticeable beam 
broadening as long as the data readout is performed on a time 
scale corresponding to a telescope motion of less than about 
0.65 HPBW. When observations ask for a lower spatial reso- 
lution, e.g. for the comparison of line ratios, or a coarser sam- 
pling, it is possible to integrate longer, thus reducing the noise. 
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Nevertheless, even in those cases, one should always prefer to 
integrate for shorter periods, if this is technically possible, to en- 
able a further analysis of the data with the full resolution. To 
foresee a later manual smearing to the goal resolution, the ob- 
servations then have to use a somewhat longer OFF integration 
time as discussed in Sect. [6j but this is usually justified by the 
gained flexibility. 

2.2. The calibration by a reference measurement 

The actual correction of the instrumental drift is done by sub- 
tracting the count rate obtained during the reference measure- 
ments at the OFF positi on from the count rate at the source points 
dKutner & UlicHll98lHOssenkopftl2002l) . 



■ CQFF 



(2) 



The radiometric noise in the calibrated data for each pixel 
thus consists of noise contributions from the source and from 
the OFF integrations, <x noise oc y/l/t s + 1/?off- In case of no dead 
times it can be easily shown that the radiometric noise for any 
total scan time N x t s + lfoFF is minimized when using an OFF 
integration time ?off = vNt s , if the N source integratio ns of 
one s can are calibrated with the same OFF measurement (Ball, 
Il976t) . Although this relation is not strictly fulfilled in the situa- 
tion of non-negligible overh eads, it remains approximately valid 
dSchieder & Kramer], 1200 lb and is thus widely used in current 
implementations of OTF observing modes at ground-based tele- 
scopes. 

However, it is not clear that the OFF measurement at the end 
of an OTF scan is always the optimum reference to be used in the 
subtraction. Alternatively, averages of different OFF integrations 
may be used as reference. Therefore, we consider general case of 
an arbitrary reference count rate cr with an effective integration 
time fR. We can distinguish three main calibration approaches: 
i) single OFF: The reference position is observed for fR = ?off 
before (or after) a series of N source points and the OFF count 
rate is subtracted from all source points in the scan: 



C S J — C s i — Cr 



(3) 



where the index i running from \ to N characterizes the differ- 
ent source points in a series. This approach is currently used as 
standard calibration for OTF observations at the JCMT, MOPRA 
and KOSMA telescopes. 

ii) interpolated OFF: The total OFF integration time /off is 
split into two OFF observations with half the integration time, 
fR = foFF/2, before and after the series of N source points. The 
reference count rate subtracted from each source count rate is 
given by the linear interpolation between the two OFF measure- 
ments 



C s ,i 



[(1 - Ocr.i + lc Rt2 ] 



(4) 



i.e. we use a new reference that is constructed from two OFF 
measurements with the half duration. Here, I is a time interpola- 
tion factor being / = if the source count rate is measured at the 
time of the first OFF observation and I — 1 if it is measured at 
the time of the second OFF. It can be obtained from 

l = f R /2 + f d ,i +(;- l/2)f s = f R /2 + f d ,i +(/- l/2)/ s 



fR + fd,l + fd,2 + Nh 



fR + h, 



where the terms fd,i and fj,2 stand for the dead times due to the 
telescope from the OFF position to the first source point and 
from the last source point to the OFF position. The number ; 



denotes the index of the source point in the current scan and 
fscan = fd.i + Nt s + fd,2 denotes the total duration of a scan. 

The actual timing of the observation can be almost identi- 
cal to case i because the OFF measurements between the scans 
are simply split into two subsequent OFF measurements with 
half duration. The only difference is that the whole observation 
is bracketed between two OFF measurements with fR = ?off/2. 
This approach is currently the default setting for the OTF cali- 
bration at the IRAM 30 m telescope. 

iii) double OFF: This approach uses the same splitting of 
the OFF measurement into two parts before and after the source 
series as case ii but uses the average of both count rates for the 
calibration instead of applying a linear interpolation in time: 



C s ,« 



1 



1 



2 CR * 1 + 2 CR ' : 



(6) 



It corresponds to Eq. (O with a fixed value I = 0.5. In this ap- 
proach all source points in a scan are calibrated with the fixed 
OFF count rate corresponding to the value at the center of the 
scan in the linear interpolation. This calibration scheme is avail- 
able as optional mode at several ground-based telescopes. 

For all following computations we will stick to Eq. (@}, 
which can be used for all calibration schemes when applying 
the appropriate weighting factors / and I — I. For the case of the 
single OFF / is set to 1 or 0, for the interpolated OFF Eq. (O is 
used, and for the double OFF I = 1/2. 

The obvious advantage of the linear interpolation (ii) is the 
complete cancellation of linear drifts. The disadvantage is the 
production of a varying noise across each series of source points 
resulting from the variable OFF contributions. In the center of 
each series, where I = 0.5, the noise from the OFF position cor- 
responds to an integration time of Toff, but at the ends, where 
/ = or I — 1, the noise from the OFF is higher by V2 because 
only a single measurement with ?r = foFF/2 contributes. This 
is actually visible in some IRAM observations where the noise 
is minimal in the center of each line but increasing towards the 
edges of the maps (Teyssier, priv.comm.). The effect is relatively 
small for long scans with ?off = "V^Vf s because the contribution 
from the OFF integration to the total noise is small. Its change 
by a factor V2 is hardly noticeable in most cases. 

2.3. Correlated noise 

The calibration of data from several source points with a com- 
mon OFF measurement always produces some artificial correla- 
tion in the final maps because the noise from the OFF measure- 
ment shows up in multiple points. When smoothing such a map 
to lower resolution the noise does not decrease with the square 
root of the number of points averaged, but the noise contribu- 
tion from the OFF will remain constant. We can describe this as 
"correlated noise given by the noise variance contribution from 
the OFF measurement and the number of map points affected by 
this contribution. From Eq. (|4]i we can see that the noise variance 
contribution from the OFF measurement to an individual source 
point follows 



ov, oc 



( 1 - if 

fR.l 



fR,2 



(7) 



assuming statistically independent noise in the OFF measure- 
ments involved. 

The different ways of adding the noise from the neighbor- 
ing OFF measurements in the different calibration approaches 
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result in a different amount of correlated noise throughout an 
OTF map. For the single-OFF calibration, with I = or 1 and 
?r = foFF we find a constant contribution from one reference 
measurement to all points of a scan. 

When splitting the OFF measurement between the OTF 
scans into two parts with half the integration time, ?r = ?off/2, 
the noise variance in each of these parts is twice the noise 
of the original measurement. For the double-OFF calibration 
with a fixed ratio 1-1/2 each source point inherits a quar- 
ter of the noise variance from each of the OFF measurements. 
Consequently their individual contribution falls at half of the 
level from the single-OFF calibration. The sum of their contribu- 
tions provides a noise level identical to the single-OFF treatment. 

When the calibration involves a temporal change as for the 
linearly interpolated-OFF calibration, the OFF noise will also 
vary from point to point. The linear interpolation between the 
two OFF measurements bracketing a scan with fR = foFF/2 cre- 
ates an OFF noise contribution that increases quadratically to- 
wards the boundaries of the scans. Compared to the single-OFF 
calibration the noise sum is higher by the factor 1 + 4(1 — 1 /2) 2 . 
The noise variance changes within a factor two matching the 
value for the single-OFF calibration at the scan center, but being 
two times as high at the boundaries. 

We can quantify the impact of this OFF noise as correlated 
noise within the map by the simple parameter o" 2 ombined , given by 
the product of the number of pixels showing the same noise con- 
tribution and the variance of this noise. This definition reflects 
the visual effect of correlated noise in a map where the eye auto- 
matically tends to integrate over parts of the map to detect struc- 
tures. The product of variance and pixel number corresponds to 
this integration. For all calibration schemes discussed so far, the 
correlated noise is restricted to single scans and can be approx- 
imated by integrating the individual noise contributions in Eq. 
(0 over the scan length ignoring the discretization of the scan in 
terms of source points. 

The single-OFF calibration and the double-OFF calibration 
produce the same correlated noise sum, as each point of a scan 
is treated with the same OFF noise variance in both cases. For 
the interpolated-OFF calibration, however, we find a correlated 
noise sum, a 2 . ■ ,, which amounts to only 2/3 of the value in 

' combined' J ' 

the single-OFF calibration, because the contributions from the 
individual noise measurements vary quadratically across a scan 
(Eq.[7]l. Although the average total noise sum amounts to 4/3 of 
the value from the single-OFF calibration we have a lower cor- 
related noise. The linear interpolation thus shows a slightly in- 
creased total noise but a decreased correlated noise contribution 
compared to the single-OFF scheme. 

An obvious further step towards an increase of the efficiency 
of the observations is the reuse of an OFF measurement for two 
two adjacent scans so that the OFF integration time can be re- 
duced by a factor two. When using the full OFF integration time 
for the reference time, i.e. ?r = ?off, all the equations from Sect. 
12.21 are still valid. The total noise variance is not changed, but 
we create additional correlations between the noise in different 
pixels. The correlated noise sum is increased by a factor two, be- 
cause the noise from one OFF measurement is spread across the 
two adjacent scans. For the interpolated-OFF calibration with a 
reuse of the OFF data for both adjacent scans the correlated noise 
sum and the average total noise variances are larger by a factor 
4/3 relative to the value obtained in the single-OFF calibration. 
Vice versa we can use an OFF calibration time of 2/3 of the time 
used in the single-OFF calibration to obtain the same amount of 
total noise and correlated noise in the interpolated-OFF calibra- 
tion. 



With respect to the radiometric noise we have thus the sit- 
uation that single-OFF and double-OFF calibration produce the 
same total noise contribution and the same correlated noise con- 
tribution from the OFF measurement to each source measure- 
ment.For the double-OFF integration we could reduce the OFF 
integration time by a factor two still maintaining the same total 
noise contribution, but at cost of a higher correlated noise. In 
the interpolated-OFF calibration we need only an OFF integra- 
tion time of 2/3 compared to the single-OFF calibration to obtain 
the same noise values, but on top of that we achieve a complete 
cancellation of all linear drift errors. 

After these general considerations we will actually compute 
the error in the calibrated data due to both instrumental drift and 
the radiometric noise both from the source points and from the 
OFF subtraction in the next section. 



3. The data uncertainty due to noise and drift 

3.1. Quantitative estimate 

The total uncertainty of the measured data is given by the sum of 
the uncertainties from radiometric noise and instrumental drifts. 
With known fluctuation spectra S (/) cc 1 /f a for the two con- 
tributions the total data uncertainty c an be computed as demon- 
strated by Sch ieder & Kramer! (12001 1) . They performed the esti- 
mate for the special case of a single-OFF calibration and spectral 
indices of the instrumental fluctuations a — 2, 3. Here, we repeat 
these computations for the general case. 

If we write the expression for the calibrated data (Eq. |2]i for 
an arbitrary start time t and the general calibration approach ex- 
pressed by the weighting factor I we obtain 

C s ,i = c M (0 - (1 - 0c R ,i(f) - /c R>2 (f) (8) 

= - I dt' s(t') 

h J/+/r+/ d j 

-{I -I)— dt' S(t')-l— dt' S(t') . (9) 

l R Jt f R J/+/ scim +/ R 

To abbreviate the notation we use here the total delay time be- 
fore a given source measurement i, fo,i = fd,i + - 1)4- In the 
same way we define ?d,2 as the total delay time after a given 
source measurement i, ?d,2 = fscan - ?d.i - k = ?d.2 + (N — i)t s . 
With the appropriate weighting factors I and I - I, the equation 
can be used for all calibration schemes discussed above. In all 
schemes where the OFF measurement is split into two separate 
contributions we use ?r = ?off/2, otherwise fR = ?off- 

Assuming ergodicity we can obtain the average total uncer- 
tainty of the count rate from a time-average 

cr 2 c (0 = ((C» - <C s ,,->,) 2 ) / (10) 

where we treat the measurement as a continuous function, ignor- 
ing that it is performed only in discrete steps. 

It can be easily seen that the maximum uncertainty occurs 
for weak signals where the count rates on the source and on the 
OFF position basically are the same. Thus we consider the worst 
case assuming (c s .,(f) - (1 - 0cr,i(0 - lcR^.(f))t = 0. Then the 
second term in Eq. ( fT0T > vanishes and we can rewrite it as 

cr 2 c d) = {csj(tf) t + (l-l)^c R , l (t) 2 } i + l 2 (c K2 (t) 2 ) t 
-2(1 - <c s ,,(?)cr,i(0>, - 2l(c sJ (t)c Ra (t)) t 
+2/(1-0<cr, 1 (0cr, 2 (0>, ■ (ID 
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The first three contributions contain the variation representing 
the noise within each of the three measurements involved. The 
other terms represent the cross correlation between them con- 
taining the mutual drift terms. 

The computation of all six terms follows the same approach. 
We demonstrate it here only for (c s j(t)cRj(t)) t : 

<c M (*)cR,i(0> t = dt' dt" s(t')s(t") .(12) 

's'R \Ji J'+'r+'d.i / 1 

With the coordinate transformation to time variables t' = t + fp> - 
t' and t" = t + ?r - t" we obtain 



series of data dumps as a function of the length of the inter- 
vals. A comprehensive introduction to this technique was given 
in paper I. The Allan variance spectrum can be computed in the 
same way as laid out above assuming zero delayfjFor I = 0, 
h — ?r = fbin, and fo,2 = ?d,2 = we can transform Eq. (fT6b into 



(c s ,;(f)CR,l(f)) f 



1 

f S ?R 



(13) 



f R dr f dr" (s(t - t'M? - r" + f D>1 + t s )) t 
Jo Jo 



where we have also exchanged the sequence of integration and 
time-averaging. The integrals can be evaluated using the auto- 
correlation function of the fluctuation spectrum. For a power-law 
noise spectrum S (f) cc 1 //<* with spectral indices < a < 3 the 
auto-correlation function can be evaluated as3 



y(r) = (s(t + T)s(t)) t 



(14) 



assuming zero averages (ISchieder & Kramerl 1200 lb . This re- 
lation and the properties of this kind of autocorrelation func- 
tions are exte nsively used in studies of fractal and turbulent 
processes (e.g. Peitsen & S^p3,[l988; Bu nde & HavTInlll994t 
lFriscR[T995h . 

Exploiting this relation, the integration can be carried out as 
demonstrated in Appendix B resulting in 

<c s ,;(0cR,i(f)> f = go- . 8 ", ff {te.i + h + t R ] a+l (15) 
a(a + l)t s t R 1 

-[f D ,l+f R f +1 -[fD,l+fsf +1 + /g?} 

We can perform the integration for all terms in Eq. (fTTb and 
obtain 



rtjO = Z7Z^{C 1 + (1 - 2/ + 2Z 2 )fg- 1 



+1(1 - 1) 



a(a + 1) 

(2fR + f S can) ff+1 _ 2(fR + f SC an)° +1 + £ 



(16) 



a+l j_ f a+l 
scan 



-(l - o 



(f R + ?d,i + f s r +1 -it* + f D ,i) ff+1 -(f D ,i + f s r +1 +c' 



r R ?s 

(f R + ?d,2 + t s y +i - (r R + tDa r +i - (f D , 2 + fs r +i + 

The first two terms contain the fluctuations within the source and 
the OFF measurements (Eq. IB.4I ). the third term represents the 
drift between the two involved OFF measurements and the last 
two terms characterize the drift between the source measurement 
and the two OFF measurements. 

The coefficient g a giving the amplitude of the fluctu ations 
can b e determined by an Allan variance measurement (Allan, 
119661) . The Allan variance measures the variance of the differ- 
ence of the signal between subsequent intervals in a long time 



ff+l ■ tcr+l 



3 For or = 1 there exists a logarithmic deviation so that equation does 
not hold for this particular value. 



2ga 



a(a + 1) 



4(2° 



bin 



(17) 



where fbin denotes the length of the data intervals. This allows us 
to express the uncertainty of the calibrated OTF data in terms of 
the Allan variance spectrum. 

However, the fluctuations of any signal are not only char- 
acterized by a single power spectrum but they consist at least 
of a superposition of white noise with a spectral index a = 
and an instrumental drift contribution with some steeper spec- 
tral index a. Fortunately, we expect no correlation between the 
radiometric white noise and the instrumental drift, so that both 
the Allan variance spectrum and the uncertainty of the calibrated 
data from an OTF observation simply are the sum of both con- 
tributions, cr\ = cr 2 AQ + o-\ a and <x^ = o"^ + cr^ a . The Allan 
variance of the white noise contribution is given by 



cr 



A,0 



#Flfbin 



(18) 



and the white noise contribution to the OTF measurement is 

2 (sV)%(l , l-2/ + 2/ 2 ) \ 
^0,0 = ~5 — 7 + : 1 < 19 ) 



OH \?s ?R / 

where Bpi denotes the fluctuation bandwidth of the radiometric 
noise. 

With the definition of the Allan time tA as the bin size fbi n 
where the drift contribution and the radiometric noise in the mea- 
sured Allan variance spectrum show the same magnitude (paper 
I), we can relate the radiometric noise to the coefficient g a . We 
obtain the coefficient of the drift contribution as 



4(2-! - l)Bn« 



(20) 



Finally we can compare the total uncertainty of the calibrated 
data cr 2 c {i) to the unavoidable uncertainty due to the radiomet- 
ric noise in an equivalent measurement with an ideal instrument 
without any drifts, in an ideal observation without the need for 
an OFF measurement. If we assume that this observation uses 
the total observing time for the points of an OTF cycle in a 
given map, f tot = ?off + f scan> me resulting data uncertainty is 



cr', 



C, ideal 



N(s(f))t 



(21) 



When we combine Eqs. d20l ) and (|2TT i to substitute g a , add 
the radiometric and drift noise contributions crL „ and cri and 

C,0 C,a 

normalize the resulting noise of the real OTF observation relative 
to the limiting ideal observation we obtain a measure for the 
actual impact of all instrumental effects on the data quality 



cr 1(f) 



C.ideal 



~~N 



1 1 
— + — 



21 + 2l 2 
xr 



(22) 



4(2 ff -! - 1) 



(1 -2l + 2l 2 )x a R l 



4 The original definition of the Allan variance by lAUarj ( fl966l) is 
lower by the factor 1/2. 
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where we have transformed all time scales relative to the Allan 
time ?a with x tot = ftot/^A, = ?s/*a and so on. 

We find two essential contributions: the first two terms char- 
acterize the radiometric noise of the observations. This noise is 
higher than the radiometric noise in the ideal observation due 
to the x R -term containing the noise from the OFF measurement. 
Without this term the radiometric noise ratio would be unity. All 
terms in the brackets characterize the drift contribution to the to- 
tal data uncertainty. The different terms stand here for the drift 
occurring during the different time lags involved in the measure- 
ment. The ratio between the drift noise and the radiometric noise 
of the observed data can be computed by simply dividing these 
two contributions. 



3.2. Comparison of the different calibration schemes 

With Eq. d22l) we can draw quantitative conclusions on the differ- 
ent calibration schemes. We have computed the data uncertainty 
(7^(0/(7^, idea] as a function of the scan length N, the spectral in- 
dex of the instrumental drift a, the position of a source point 
within the OTF scan i, the source point integration time x s , and 
the dead times between the OFF measurement and the source 
integrations in the scan. To avoid too many parameters in the 
following examples we simplify them by assuming that the two 
dead times for moving from the source to the OFF position and 
vice versa are the same, Xd,i = Xd,2- This is well fulfilled for most 
observations with the Herschel satellite and still a reasonable 
approximation for most ground-based telescopes. Moreover, we 
assume in this section that the total integration time on the OFF 
position follows the standard rule xoff = VNx s derived for an 
ideal telescope. 

In Fig. [2] we compare the three standard calibration schemes 
from Sect. 12.21 for an example scan consisting of = 10 
points. In the simulation a spectral index of the instrumental 
drift a = 2.5 was used, typical of spectro scopic fluctuations 
dSchieder & Kramerll200ltlOssenkopj,l2008l) . and the total dead 
time given as the sum of the dead times before and after an OFF 
measurement was assumed to be a quarter of the Allan time 
which is a typical value for many Herschel observations. The 
figure shows the normalized total noise RMS as a function of 
the position of a source point within the scan for different source 
integration times x s . 

For all three cases we find that the shortest integration time, 
x s = 0.01, results in a relatively high noise. This can be eas- 
ily understood by the low efficiency of this observation where 
only a short integration time is spent on the source but a large 
fraction of the total cycle is occupied by the dead times. In this 
relatively fast cycle no instrumental drifts are seen and the noise 
is barely varying across the scan. The variation of the radiomet- 
ric noise across the scan by up to 11 % for a 10-points scan in 
the interpolated-OFF calibration scheme discussed in Sect. 12.21 
actually is much lower because all 10 points are measured close 
to the center of the time interval between the two OFF measure- 
ments. 
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Fig. 2. Variation of the total data uncertainty across an OTF scan ob- 
tained in the different OTF calibration schemes. The RMS of the fluc- 
tuations is plotted relative to the RMS which would be obtained by an 
ideal instrument in the same total time. Part a shows the result from 
the single-OFF calibration with the OFF measured before the scan, b 
the double-OFF calibration, and c the interpolated-OFF calibration. A 
scan length N = 10, a spectral index a = 2.5, and a total dead time 
*<u + x d,2 = 0.25 were used. 



In contrast, we find a strong noise variation across the scan 
for all calibration schemes when the source integration time per 
point is in the order of the Allan time. All data are dominated 
by instrumental drifts also leading to a high total noise. An in- 
termediate integration time per source point results in the lowest 
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overall data uncertainty. In this example the optimum falls be- 
tween about 0. 1 Allan times for the single-OFF calibration and 
0.2 Allan times for the interpolated-OFF calibration. Further pa- 
rameter studies show that the optimum falls at shorter integration 
times for shorter dead times and for larger numbers of source 
points in a scan. 

Comparing the calibration schemes shows extreme differ- 
ences in the impact of the instrumental drift. The single-OFF 
calibration has a huge sensitivity to instrumental drifts. When 
using the OFF measurement before the scan for calibration, as 
shown in the figure, the total data uncertainty grows monoton- 
ically towards the end of the scan, for x s - 1 it exceeds even 
the selected plot range going up to a value of 1 1 . For the corre- 
sponding scheme using the OFF after the scan, the plot would 
be mirrored. The advantage of the interpolated-OFF calibration 
relative to the double-OFF calibration is also clearly visible. The 
latter has a much larger data uncertainty at the ends of the scan 
due to instrumental drifts. Compared to this uncertainty, the vari- 
ation of the radiometric noise from the OFF calibration in the 
interpolated-OFF scheme, well noticeable only for x s = 0.05, is 
a very small contribution. In the center of the scan double-OFF 
and interpolated-OFF calibration necessarily have to agree. For 
all cases with a noticeable instrumental drift, the interpolated- 
OFF calibration is superior to the double-OFF calibration. The 
latter one is slightly better for fast scans where instrumental 
drifts play no role. 

When performing the same computations for longer OTF 
scans, which are usually used when mapping large areas on the 
sky with ground-based telescopes, we find the same qualitative 
behavior as shown in Fig. [2] but an increase of all drift effects, 
due to the longer times covered, and a further reduction of the 
radiometric noise variation across the scan, so that this turns in- 
visible in the corresponding plots. An extended parameter study 
has shown that the data uncertainty due to drift effects grows 
with the spectral index of the fluctuation spectrum a and with 
the dead times before and after the OFF measurements. For 1 // 
spectra, we obtain a very weak dependence of the maximum drift 
noise on the integration time, the scan length or the dead time. 
For moderate scan lengths, $100 points, and integration times 
covering a noticeable fraction of the Allan time, the maximum 
total noise RMS is always approximately twice the ideal noise 
RMS. However, 1/f spectra are often also correlated with very 
short Allan times, then limiting the observations. One has to be 
aware that all time scales have to be considered relative to the 
Allan time. For steeper noise spectra, the drift provides the main 
limitation to the possible scan lengths. Dead time, Allan time 
and drift index are determined by instrument and telescope so 
that their design should be directed towards a minimization. The 
main prerequisite for any accurate mapping observation is a a 
low instrumental drift expressed by a long Allan time and/or a 
shallow drift index. 

One can still improve the observing efficiency by an appro- 
priate calibration scheme and an optimized setup of the obser- 
vations. The drift uncertainty is increased by a larger number of 
source points in each scan but reduced in the case of shorter inte- 
gration times per source point. The mutual optimization of these 
two parameters leads in general to the smallest uncertainties for 
scans with a large number of points but very short integration 
times. This may be limited, of course, by the size of the region 
to be mapped and data rate which can be taken with the instru- 
ment. A detailed optimization taking both effects into account is 
given in Sect. 15.11 

Looking at the overall pictures it is clear that the 
interpolated-OFF approach is in general the most robust one. 



The single-OFF calibration is easily disqualified compared to the 
other two schemes. The double-OFF calibration can be slightly 
better than the interpolated-OFF calibration if the integration 
time is very short and we have an accurate knowledge of the 
instrumental drift behavior. Taking the usual uncertainty and the 
statistical fluctuations of the actual drift behavior into account, 
leads us, however, to the general preference of the interpolation 
scheme. These results hold independent of the possible split or 
combination of the OFF integrations with respect to the neigh- 
boring scans (Sect. |2~3l as this would only affect the correlated 
noise sum, not changing the noise amplitude computed here. 



4. Application to observed data 

The different calibration schemes were tested using existing 
molecular line observations performed with the KOSMA 3 m 
telescope. An arbitrary OTF patch was taken f rom a larger sur- 
vey 1 3 CO 2-1 survey of the Cygnus X region dSchneider et all 
2006). The observations were taken in the ordinary OTF mode 
where after each line of the patch, containing 20 source inte- 
grations of 5 s, one OFF integration of 23 s, corresponding to 
V20 x 5 s, was performed. For the slew from the end of an OTF 
scan to the OFF position a dead time of 19 s was needed, the slew 
from the OFF position to the beginning of the subsequent OTF 
scan took 12 s. The spectral resolution of the used backend is 
360 kHz and the corresponding fluctuation bandwidth 560 kHz. 
The spectroscopic Allan time of the instrument at this resolution 
is about 120 s. The Allan time of the whole system including 
the atmosphere is estimated to be approximately 80 s. The drift 
index of the fluctuations falls between 2 and 3. For all computa- 
tions we assume 2.5 here. The single-sideband system tempera- 
ture during the observations was about 350 K. 

To emphasize the drift effects we first consider maps of line 
integrated intensities, where the full velocity range of the I3 CO 
line from 4 to 8 km/s was integrated corresponding to an effec- 
tive bin width of 2.9 MHz. As the binning reduces the radiomet- 
ric noise of the data, this corresponds to a reduction of the Allan 
time, where radiometric and drift noi se have equ al amplitudes. 
Following the formalism developed in lOssen kopf (2008) we can 
compute an effective Allan time at 2.9 MHz bin width of about 
30 s. One OTF cycle corresponds to approximately five Allan 
times at this resolution so that we expect to notice drift effects in 
the integrated maps. 

Figure [3] shows the integrated line maps obtained in the dif- 
ferent calibration schemes. We can compare the observed struc- 
ture with the noise computed from Eq. d22l . An ideal observa- 
tion, spending all the observing time for the map integration, 
would result in a radiometric noise of <x as 0.1 K (see Eq. |2"TV 
Due to overheads and the noise contribution from the OFF mea- 
surement, the actual radiometric noise is higher by a factor be- 
tween 1.31, obtained for the double-OFF calibration and in the 
scan center for the interpolated-OFF calibration, and 1.37 for 
the single-OFF calibration. The drift noise cr C drift varies across 
the 20 points of the scan between 0.65 and 0.72 cr Crad for the 
double-OFF calibration, between 0.56 and 0.65 cr Crad for the 
interpolated-OFF calibration, between 0.77 and 1 .45 cr Cra( j for 
the single-OFF calibration using the OFF before the scan, and 
between 0.82 and 1.49 cr Crad for the single-OFF calibration us- 
ing the OFF after the scan. While the drift noise should be hidden 
in the radiometric noise for the double-OFF and the interpolated- 
OFF calibrated data, it should be clearly noticeable in the single- 
OFF calibrated maps towards the ends of the scans which are 
most apart from the corresponding OFF measurement. 
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Fig. 3. Demonstration of the influence of the different calibration schemes on the appearance of the produced line maps. The maps in the upper 
panels were obtained when calibrating with the single OFF measurement before and after each line. The lower left panel shows the result with a 
fixed sum from the two adjacent OFF measurements (/ = 1/2) and the lower right panel shows the result when using a time interpolation between 
both OFF measurements. The map was measured in horizontal stripes. After each line an OFF measurement was taken. 
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Fig. 4. Autocorrelation function for the maps from Fig.[3]measured in 
the two perpendicular directions a and 6. To emphasize the scan direc- 
tion a, the symbols for that direction are connected by lines. For an 
isotropic structure both directions should show the same values. 



We can clearly recognize the very "stripy" structure in the 
two single-OFF calibration maps, but the stripes are not re- 
stricted to one end of the scans but cover large parts of the map. 
The drift effects are visible as global structures in the integrated 
line maps, but for each single source point they could still be 



hidden in the radiometric noise. We have to keep in mind that 
the Allan variance is only a statistical measure to characterize 
the drift behavior. Thus we cannot expect to find uniform drift 
effects in all scans but we will always find lines with stronger 
and weaker indications of instabilities. Eq. ( l22b only gives the 
lcr uncertainties of a stochastic process. 

We can quantify the "stripiness" of the maps by compar- 
ing variations in the map in the direction of the OTF scans and 
perpendicular to them assuming that the observed astrophysical 
structure will be more or less isotropic. Figure|4]shows the auto- 
correlation functions A(Ar) = {C(r)C{r + \r)) r /(C(r) 2 ) r when 
using Ar parallel and perpendicular to the scan direction for all 
four calibrated maps. For an isotropic structure the autocorrela- 
tion function should decay in both directions in the same way. 
We notice, however, significantly lower values of the autocorre- 
lation function measured in the ^-direction for all four maps at 
shifts of one or two pixels indicating variations due to the map- 
ping structure. The effect is largest for the single-OFF map using 
the OFF after the scan with the strongest stripes also visible by 
eye in Fig. [3] The maps resulting from the double-OFF calibra- 
tion and the interpolated-OFF calibration have almost the same, 
still significant, anisotropy which is, however, reduced by a fac- 
tor two compared to the single-OFF map with the OFF after the 
scan. 

The integrated line map discussed above is strongly suscepti- 
ble to drift effects due to the short effective Allan time. To better 
study the effect of correlated radiometric noise we can use single 
channel maps where the radiometric noise is higher and the rela- 
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Fig. 5. Channel map at 5.7 km/s obtained in the different calibration schemes from the same data as used for the integrated line maps in Fig.FJ] 
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Fig. 6. Autocorrelation function for the maps from Fig.[5]measured in 
the two perpendicular directions a and 6. 



tive drift contribution is lower. An ideal observation would give a 
noise of cr(f tot ) a 0.23 K in this map. With the overheads quoted 
above this corresponds to 0.30 K radiometric noise per point and 
the maximum drift noise contribution cr cdrift computed for the 
single-OFF calibration is only 0.44 cr Cla d.. Figure [5] shows the 
maps from the spectrometer channel at the peak of the average 
line profile obtained in the different calibration schemes. Figure 
[6] shows the corresponding autocorrelation functions measuring 
the anisotropy. We find, much smaller differences between the 
calibration schemes than in Fig. [3] but the differences are still 



dominated by drift effects and not by the different level of cor- 
related radiometric noise. Those lines showing drift signatures 
in the integrated maps are the same lines that show the smaller 
deviations in the channel maps although the radiometric noise is 
higher in the channel maps. The channel maps are much more 
isotropic than the integrated line maps, but the maps calibrated 
by a single OFF still show noticeable stripes. 

We can conclude that at least for maps with more that 10 
pixels, the effect of correlated radiometric noise is small so that 
the choice of the calibration scheme should be based on the ca- 
pability of correcting the drift of the system. The application of 
the different calibration schemes to real observations confirms 
the theoretical considerations that the single-OFF calibration is 
much more susceptible to drift effects than the other two calibra- 
tion schemes so that it should be avoided. The example showed 
no significant advantage of the interpolated-OFF calibration rel- 
ative to the double-OFF calibration. Both schemes lead to a 
strong reduction of the stripiness of the resulting maps. 

5. Global optimization 

5.1. Minimization of the total noise 

For any given calibration scheme, the formalism introduced in 
Sect. I3.ll can also be used to optimize the actual timing of the 
observations. We can adapt the scanning speed of the OTF ob- 
servations to provide source integration times resulting in a min- 
imum uncertainty of the calibrated data. 

This is demonstrated in Fig. Q where the total data uncer- 
tainty is plotted as a function of the source integration time for 
different scan lengths when the interpolated-OFF calibration is 
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Fig. 7. RMS of the total data uncertainty from radiometric and drift 
noise relative to the noise from an ideal instrument integrating the same 
total observing time. The noise is plotted as a function of the source 
point integration time relative to the Allan time for different numbers of 
points per scan, N. The interpolated-OFF calibration, a spectral index 
a = 2.5, and a dead time x,i i + x&p, = 0.25 were used here. 



used. A spectral index a = 2.5 typical of spectroscopic drifts and 
a total dead time of 0.25 Allan times were assumed here. The 
plot shows the maximum value across the OTF scans, which is 
typically reached at the ends of the scan for short cycles and in 
the center as soon as drift effects start to dominate. 

For all scan lengths we find a characteristic minimum cor- 
responding to the optimum source integration time. When us- 
ing longer integration times, drift effects start to dominate. At 
shorter integration times, the relative overhead from the slew to 
and from the OFF position reduces the observing efficiency so 
that the radiometric noise is too high. We also see that the rel- 
ative accuracy of the observations at the optimum timing grows 
with the scan size N. The equivalent plot for the double-OFF cal- 
ibration shows a steeper increase of the noise when the source 
integration time is above its optimum value but minima which 
are slightly deeper than the minima shown here. 

A special behavior occurs for very shallow spectra (a $ 
0.75). They show about the same slope at long integration times 
for all scan lengths so that the curves do not intersect. This means 
that very long scans are always favorable even if the resulting cy- 
cle length is much longer than the Allan time. This can be under- 
stood from the fact that in fluctuation spectra shallower than 1 //, 
the noise is further reduced with increasing integration time, just 
like in the familiar case of white noise, but with another slope. 
The exact value for the transition to this behavior depends on the 
dead times involved but the limit a = 0.75 is a good approxima- 
tion for most cases. 

Independent from the spectral index of the fluctuations we 
find that the best observing mode is always given by very 
long scans with many points and a very short integration time 
per point in each scan. A full observation is obtained from 
many of these short-time cov erages. This was already shown 
by ISchieder & Kramer (2001). Unfortunately, there are always 
practical limitations to this approach. A telescope cannot move 
arbitrarily fast and the integrated data cannot be read out and 
dumped at an infinite data rate. Thus, the minimum relative inte- 
gration time x m i n set by the instrument is a limiting quantity for 
the optimum OTF timing. Moreover, the size of the astronomi- 
cal source naturally constrains the scan size N. Small maps may 



consist of a limited number of points N mdx only. For any given 
Xmin and A^ max , a plot like Fig.|7j computed for the actual slewing 
time + Xd,2, can be used to obtain the optimum setup. In most 
cases, the solution will still fall at the extreme provided by the 
maximum possible number of points and the minimum possible 
integration time. 

At a number of ground-based telescopes (e.g. JCMT, 
MOPRA, KOSMA) the implemented OTF mode identifies the 
OTF scan length N with the length of a single line in an OTF map 
Mine. However, there is no a priori justification for this identity. 
In a more general approach, used e.g. at IRAM and APEX, mul- 
tiple lines are combined within one OTF scan. An even more 
generalized approach is foreseen for the pointing mode defini- 
tions of the Herschel Space Telescope where an arbitrary number 
of points is measured between two OFF measurements, resulting 
in scan sizes that may cover also parts of map lines. This is par- 
tially motivated by the relatively slow slew to the OFF position 
by the telescope. Here, we consider this most general approach. 
When combining multiple map lines in one OTF scan the turn- 
around delay between subsequent lines, r turn , has to be taken 
into account when computing the total noise in the data. For a 
point i measured within an OTF scan of length N, the number 
of turns before and after this point are A'tum.i = _ 1 Valine and 
Mum,2 = (N — 0/Mine, respectively. In Eq. (1221 . the total delay 
before the measurement xd,i has to be increased by A^m-ru-Xtum, 
the total delay after the measurement xd,2 by AWa-Xtum, and the 
total scan length x scan by (AVn.i + N tum ^)x tum , where x turn de- 
notes the turn time relative to the Allan time, x turn = f tU rn Aa- 
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Fig. 8. Relative RMS of the calibrated data in OTF observations as a 
function of the scan length and the integration time per source point 
relative to the system Allan time. Values of more than 10% above the 
minimum are clipped in the plot. The asterisk marks the optimum setup 
resulting in the minimum noise at N = 180 and x s = 0.028. A line 
length of 30 points, an OFF dead time x^i + X42 = 0.6, a turn time of 
the telescope x lurll = 0.15, and a spectral index a = 2.5 were used here. 



In this case, the minimum of the relative data uncertainty is 
no longer found at the maximum scan length and the minimum 
possible integration time because of the increasing overhead for 
turns with increasing scan length. The optimization has to be 
done by actually evaluating Eq. (l22l for different scan lengths 
and integration times. This is demonstrated in Fig. [8] showing 
the total noise RMS as a function of source integration time and 
scan length for a map where A^e = 30 and the relative inter-line 
overhead is x tura = 0.15. 
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The most important feature in this plot is the large extent of 
the valley. Within the whole colored part of the plot, the noise 
RMS only changes by 10 % and even a 2 %-contour encloses a 
factor six in scan length and a factor two in the integration time. 
This means that OTF observations are extremely robust with re- 
spect to bad timings. Even in setups far from the optimum, the 
noise RMS is typically only enhanced by some ten percent. This 
explains why the OTF mode is used very successfully at many 
ground-based telescopes without a thorough theoretical analysis. 

The staircase structure of the contours reflects turns which 
are added when the scan length exceeds multiples of the line 
length. We find several minima with their deepest points always 
at integer multiples of the line length. In this example, the ab- 
solute minimum falls at a scan length of 180 points, but scans 
with 90, 120, 150, or 210 points practically are not worse. In nu- 
merous tests with parameters typical of different telescopes we 
found no case with a minimum not falling at an integer multiple 
of full lines. Thus we can always complete lines in an OTF scan 
before going to the OFF position and it is typical that one can 
combine several lines in an OTF scan. To optimize actual obser- 
vations, it is thus sufficient to evaluate Eq. ( 1221 for scan sizes 
being integer multiples of the line length. 

With the additional overhead given by the dead time between 
the OTF lines, the optimum source integration time is no longer 
automatically given by the minimum time allowed by the instru- 
ment. We find an optimum source integration time x s between 
0.02 and 0.04, corresponding to a full scan duration of about 
seven Allan times. This is a contradiction to the general wis- 
dom, valid for symmetric observing modes, that the reference 
cycle period should be shorter than the Allan time. It can be eas- 
ily explained by the fact, that the Allan time always compares 
radiometric and drift noise, but only a small fraction of the full 
period is used to integrate down the radiometric noise for any in- 
dividual map point. However, this results in a general warning. 
To make sure that the proposed optimization scheme is actually 
valid, the Allan variance spectrum must not only be known up 
to a few Allan times, but it has to be determined over at least 
the time scale expected for the longest OTF observing cycles. 
This long term spectrum needs to be used the measure the sta- 
bility time and to fit the drift power law to use the optimization 
formalism derived here. 

Eq. d22b can also be used to check the optimum integration 
time for the OFF measurement. A rough estimate in Sect. l2.3l had 
shown that for the interpolated-OFF calibration scheme an OFF 
integration time of foFF = 2/3 x yfNt s should be sufficient. By 
introducing q as a free parameter characterizing the relative OFF 
integration time ?off = q we can include it in the optimiza- 
tion. Searching for the global minimum in the three-dimensional 
parameter space spanned by N, x s , and q, we find an optimum 
value of q — 0.69 for the parameters used in Fig. [8] a number that 
is close to the theoretical value from Sect. 12.31 The minimum is, 
however, very broad in the ^-dimension so that the exact choice 
of the OFF time has hardly any influence on the total data uncer- 
tainty of the calibrated data. We find the same kind of robustness 
as for the scan lengths. Varying the model parameters showed 
that the optimum scan length depends strongly on the spectral 
index of the fluctuations, but that the optimum ^-parameter is 
always close to 0.7. This value can be used for all observations 
applying the interpolated-OFF or the double-OFF scheme if the 
full integration time of both OFF measurements is used in the 
calibration. 

To provide a feeling for the results, we give in Table Q] a 
few realistic examples. We compare the optimum timing and 
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Table 1. Examples for parameters of an optimum timing in an OTF 
map covering 30 points in each line. All values are given for a spectral 
resolution of 1 MHz, corresponding to a typical fluctuation bandwidth 
of 1.5 MHz. 



observation t& a 
M 


N t s a/a(t lot ) ^ nft /< dl0 
M 


Ground-based telescope: t sm m = 1 s, ?d,i + h\,2 = 20 s, t tum = 8 s 


spectroscopic 80 2.5 
total-power 8 1.5 


180 2 1.17 0.07 
60 1 1.70 0.70 


HIFI at Herschel: f,_ m j n = 1 s, t^i + = 80 s, t tum = 20 s 


spectroscopic 150 2.5 
total-power 10 1.5 


180 4 1.23 0.10 
150 1 2.13 1.03 



the resulting noise for typical parameters of a ground-based 
telescope and of the HIFI instrument at the Herschel Space 
Telescope. All values are given for a reference resolution of 
1 MHz, which is typical of many high-frequency observation, 
but not for mm observations of Galactic sources. For such obser- 
vations the numbers may only apply to binned or line-integrated 
data. Ground based telescopes have the general advantage that 
they can quickly slew to the OFF position and store the measured 
data at a high rate. Their big disadvantage is the atmospheric in- 
stability resulting in a relatively short Allan stability time for 
spectroscopic and continuum measurements. For HIFI observa- 
tions with the Herschel Space Telescope we expect a more sta- 
ble configuration, with an Allan time of about 150 s for spectro- 
scopic drifts and of about 10 s for total -power drifts (see Sect. 
[T). On the other hand, all telescope slews are relatively slow so 
that we can estimate a slewing time of about 40 s when going to 
an OFF position which is 20' apart from the source. 

For spectroscopic observations, being only sensitive to rela- 
tive variations of the sensitivity across the spectrometer, we find 
in both cases an optimum scan length covering six full lines and 
■*s,opt ~ 0.025 which translates into optimum integration times 
per source point of 2 s and 4 s, respectively. The full range of 
good observing parameters covers again scan lengths and inte- 
gration times differing by up to a factor two from the optimum 
values. The drift contribution to the total noise is small in both 
cases, but the ground-based telescope is clearly superior in terms 
of the total noise per observing time due to the shorter dead 
times. For the Herschel observations the additional complexity 
of scanning subsequent lines in opposite directions is well jus- 
tified because a limitation of the scan length to the line length 
would increase the optimum noise RMS by 9% corresponding 
to a 19% loss of observing efficiency. In observations for simul- 
taneously determining the continuum level, where total-power 
drifts become relevant, the optimum integration time per source 
point falls below the 1 s minimum readout time for both configu- 
rations, so that the minimum time provided by the instrument has 
to be used. For a 1 s readout and the corresponding optimum scan 
length we find a significant drift contribution with an amplitude 
of 70 % of the radiometric noise for the ground-based example 
and 103 % for HIFI at Herschel. This means that the baseline 
uncertainty is as large as the radiometric noise contribution. The 
overall noise efficiency of the observations is low, relative to an 
ideal instrument we obtain 35 % for the ground-based and 22 % 
for the Herschel continuum observations. Using the OTF map- 
ping mode for an accurate determination of the continuum level 
with heterodyne instruments is therefore questionable. 
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6. Discussion 

One has to keep in mind that OTF observations in general and the 
optimization scheme proposed above in particular have a serious 
drawback. When reusing the calibrated data for purposes which 
were not foreseen when planning the observations, by spatial or 
spectral rebinning, the relative gain in the noise reduction is al- 
ways lower than for pure radiometric noise. In spatial rebinning 
the contribution of th e correlated noise fro m the OFF measure- 
ment stays constant dBeufher et all 120001) . By extending OTF 
scans over multiple lines of a map this effect is in principle even 
more enhanced. On the other hand, the total noise contribution 
from the OFF measurement drops with increased scan lengths so 
that the effect of correlated noise is also somewhat reduced by 
extending the scans. When applying the temporal optimization 
scheme the same becomes true for the spectral rebinning. The 
Allan time used to optimize the observations is determined by 
the ratio of drift noise and radiometric noise. Rebinning spectra 
to a coarser resolution only reduces the radiometric noise so that 
the relative contribution of the drift noise is enhanced. 

Thus, OTF maps have always a limited use with respect to 
spatial of spectroscopic rebinning. In both cases artificial struc- 
tures due to instrumental drifts or due to the correlated noise are 
enhanced. Consequently, a very careful planning of the observa- 
tions has to be performed. The observer has to find a compromise 
between the efficiency of the observations and their re-usability. 
The optimization scheme should always be applied at the level of 
the coarsest spatial and spectral resolution that might be used for 
interpreting the calibrated data. The actual data taking can hap- 
pen at a much higher spatial and spectral sampling. For the high 
sampling the observations will be less efficient, but the planning 
then guarantees that no artifacts will be produced by the fore- 
seen smoothing. For example, if the observations are taken on a 
Nyquist sampled grid but a smoothing to a half-sampled grid, i.e. 
a reduction of the number of independent points by a factor four, 
is foreseen the OFF integration time should be approximately 
doubled compared to the case where the Nyquist sampling rep- 
resents the spatial goal resolution. Then all artifacts from the 
observing mode are suppressed, however, at the costs of the ob- 
serving efficiency. The more precise the scientific application of 
the measured data can be specified in terms of spatial and spec- 
troscopic resolution the better can the actual observing scheme 
be adapted to the application resulting in more efficient observa- 
tions. 

7. Conclusions 

In most cases mapping observations should follow the scheme 
known from OTF maps where the calibration of several source 
points uses a common OFF measurement for reference. This is 
far more efficient than all other reference modes. The efficiency 
can be further enhanced by combining multiple lines with one 
OFF measurement. This introduces, however, correlated noise 
across the calibrated map stemming from the common OFF in- 
tegration. The impact of this correlated noise can be reduced 
by using the two neighboring OFF measurements as reference. 
Their optimum integration time is approximately 0.7 yNt s . 

In most cases the calibration of the source data should follow 
the interpolated-OFF scheme where the data from both neigh- 
boring OFF measurements are weighted according to their tem- 
poral distance from the source measurement. This compensates 
all linear drifts of the instrument and results in the lowest to- 
tal uncertainty of the calibrated data. The single-OFF calibration 
still used at several telescopes should be immediately abandoned 



because of the strong sensitivity of the calibrated data to drift ef- 
fects. For short scans with less than 10 points at a fast telescope 
the double-OFF calibration is superior to the interpolated-OFF 
calibration. However, as soon as drift effects may become im- 
portant, the robustness of the interpolated-OFF scheme turns it 
superior. 

The total uncertainty of the calibrated data consisting of ra- 
diometric noise and drift noise can be computed when the fluc- 
tuation spectrum of instrumental instabilities is known, i.e. an 
Allan variance measurement was performed. For a known spec- 
tral goal resolution, the result can be used to optimize the time 
line for the actual realization of the mapping observations. It 
turns out that the OTF observing mode is in general very robust 
with respect to non-optimal timings. The scan length and the 
source integration time can be varied within a relatively broad 
range without increasing the total noise in the calibrated data by 
more than a few percent. 

The optimization reveals some general relations on condi- 
tions for accurate and efficient mapping observations: 

- The efficiency of all mapping modes grows with growing 
map size. 

- The possibility of fast data readouts is in many cases essen- 
tial to minimize the drift contributions. 

- In most conditions OTF scans can consist of integer multi- 
ples of complete map lines. 

The most essential impact on the data accuracy is provided 
by the system stability. All intervals have to be considered rel- 
ative to the Allan time. The main prerequisite for any accu- 
rate mapping observation is thus a long instrumental stability, 
as measured by the Allan time. Due to the low gain stability of 
most heterodyne instruments it turns out that it is often impos- 
sible to derive significant information on the continuum level of 
astronomical sources using the OTF or raster mapping modes. 
They are always heavily influenced by the instrumental drifts. 

Both the general design of the mapping modes with a com- 
mon OFF measurement and the temporal optimization limit the 
re-usability of the data with respect to spatial or spectroscopic 
rebinning. The setup should be optimized with a clear picture of 
the resolution requirements set by the scientific goal of an obser- 
vation. 
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Appendix A: Effective beam broadening in OTF 
maps 

Often OTF observations are not be performed exactly on Nyquist 
sampling. Traditionally the difference between a sampling at half 
the beam width, HPBW/2, and a full Nyquist sampling is ig- 
nored by using the slightly coarser sampling. For a comparison 
of different tracers, it is moreover useful to map them at the same 
raster, even when the observation at different frequencies leads to 
a slightly different sampling with respect to the telescope beam. 
Thus it is very common to use samplings deviating from a full 
Nyquist sampling. We can compute the quantitative impact of 
the beam smearing in the general case by the numerical convo- 
lution of a two-dimensional Gaussian beam profile with a strip 
function of finite size representing the motion of the telescope 
during the integration. The result is shown in Fig. IA. 1 1 The solid 
line shows the increase of the half -power beam width (HPBW) 
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Fig. A.l. Beam broadening due to the scanning motion of the telescope 
during integration in OTF observations. The solid line shows the ratio 
between the HPBW of the effective beam in scanning direction and the 
original Gaussian beam. The dotted line represents the ratio between 
the corresponding equivalent widths and the dashed line the ratios of 
the standard deviations. 



with increased integration length. We find that the beam broad- 
ening goes from the mentioned 4 % at the Nyquist sampling of 
0.42 HPBW to 6 % at 0.5 HPBW and to 25 % at 1 HPBW. When 
using an integration length above 2 HPBW, the beam is com- 
pletely dominated by the strip length. At 2 HPBW, the original 
beam contributes only by 2 %. The beam becomes less and less 
Gaussian. As a measure for the distortion of the beam shape we 
have also plotted the ratio between the standard deviation of the 
actual beam and the original beam. It grows much slower than 
the HPBW. The beam shape is close to Gaussian for scan lengths 
below 1 HPBW and almost rectangular above 2 HPBW. For an 
arbitrary beam shape the actual resolution is better described by 
the eq uivalent width of the beam, eq = j P(8)d6/P m:iK , dKrausL 
1980). This is shown as dotted line in Fig. IA.1I It follows the 
curve for the HPBW for integration lengths below 1 HPBW, but 
is about 5 % lower at long integration lengths. The beam size 
perpendicular to the scanning direction is never affected by the 
OTF observing scheme. 



<c,(f)Q(f)>, = so - -V f 1 [It' + d\ a + (T'+d + hY] 
hh Jo a 

a(a + l)fi?2 L 



(i+i 



■\t 2 +d\ a+l +\d\ a+l ] (B.3) 



The correlation between the signals measured in t \ and de- 
cays basically with the total time spanned by the two integrations 
and the delay between them (first term in brackets), reduced by 
some corrections for the finite integration times. 

One special case is the auto-correlation over the same time 
interval, i.e. h~h~ ~d- Then we obtain the simple expression 



(cdtf) t 
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Another important case is the correlation measured in Allan vari- 
ance measurements with t\ — to, d — 0. The correlation between 
two adjacent intervals of the same length is 



(ci(f)c 2 (t)) t =go~ 



2g a (2 a - l)^- 1 



(B.5) 



a(a + 1) 

For the case of white noise, a = 0, Eq. ( fTrjj i does not hold, 
but the correlation function represents a Dirac ^-function, y(f) = 
go6(\r\). We obtain 
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This means that a noise correlation occurs only within the period 
of overlap of the two measurements given by the negative delay, 
-d. For all positive or zero delays, like in the Allan variance 
measurement, the correlation vanishes. The special case of the 
auto-correlation over the same time interval, leads to the well 
known radiometric noise behavior ^ci(f) 2 ) ( = go/h ■ 



Appendix B: Signal correlations from power-law 
autocorrelation functions 

Noise functions exhibiting a l/f a power spectrum (0 < a < 
3, a + 1) are characterized by even power-law autocorrelation 
functions 

y{r)=g -g a \r\ a - 1 . (B.l) 

The average correlation of the signal measured over a pe- 
riod 1 1 with the signal measured over separated by an arbitrary 
delay d can be computed as 

<ci(f)c 2 (f)) f = — f 'dr' f ~dr"y{T - t" + d + f s ) (B.2) 
hh Jo Jo 

= _L f V C' dT " f gQ _ | T ' - T "+d + t s r 1 )- 
hh Jo Jo 

We can always chose t\ to start before or at the same time as 
h, so that ?i + d > 0. Then we obtain 



Appendix C: Raster map observations 

Raster map observations differ from the OTF observations dis- 
cussed in the main part of this paper by pointing individually 
at the different source map points instead of continuously scan- 
ning over the source. This has two effects. First, the effective 
beam of the observation is always equal to the actual telescope 
beam. It does not suffer from the beam broadening discussed for 
OTF maps in Sect. 12. 11 Second, it introduces dead times between 
the observation of different points of a map. The observation of 
each source point is characterized by two time constants here, 
the source integration time f s and the slew time to the next map 
position f m . For all points in the map, except for the last point 
of a scan, the total time needed for the measurement is given by 
fs.tot = fs + fm- No additional turn time between two map lines 
is required. If we redefine the slew time to the OFF position as 
t'i 2 — f d,2 - fm we can use all equations derived above for the 
calibration and the noise estimate in the OTF mode by using t s 
whenever the integration time counts and f s tot whenever delays 
enter. 
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In particular the interpolation measure / derived in Sect. 12.21 
(Eq. [5} turns into 

l _ W2 + f d ,i + (1- l/2)f s , tot 

fR + *d,l + t' d2 + ^ f s,tot 

For the estimate of the total noise in the data Eq. d22l can 
still be used when the total delays include the additional slew 
times, i.e. 

Xd,\ = x d ,i+ (i - l)x Sjtot 

XD,2 = *d,2 + (N- i)x sM = x d 2 + {N - z)* s ,tot + *m 

Xscan = Ml + Nx s ,uyi + x' d2 (C.2) 

The resulting general behavior corresponds to an OTF map 
with very long delays before and after the lines. The corre- 
sponding optimum timing may consist of scan lengths which are 
shorter than the line lengths but there are no qualitative differ- 
ences to the properties discussed for OTF observations. 
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